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1 Abstract 
CN ' 

In the paper the step-by-step principles for making local model of electron density are described, 
i— i, They are based on modulation principle - electron density dependence on time is a product of a 

■ number of temporal variations caused by solar radiation, magnetic activity, Earth orientation and 
Oh' unknown additional periodical processes (not a sum, as they suppose sometimes when making such 

models). A multiranges modulation principle is also suggested, that allows automatically extend the 
' set of parameters by using new ones, obtained by filtration (or averaging) of basic set of parameters 

over the time. In the paper we describe two approaches to the model creation - descriptional and 
^ ■ predictional ones. 

To test the approach three different models were created for daily electron density logarithm using 
' ^ , the described principles. We have used the data of Irkutsk digisonde over the period 2003-2007 years 

for testing. It becomes clear that a non-optimal choice of the number of model parameters could 
<**] ' increase prediction error, inspite the error over the set, used for analysis, will decrease. It is shown 

Oh that one year prediction has accuracy about 9-23% depending on the height, and the highest error 

corresponds to the height about 200km. From the modelling we could also see that with increasing of 
I \ the height the number of parameters increases, and this could be caused by inaccuracy of the model 

■ or by not taking additional physical mechanisms into consideration. 

*n : 

O 1 Introduction 

-xf ; 

^vq One of the standard approaches to the building of local models is regression model for electron density 

t-H 1 |Holt et al,2"002l : 

o : 

O N(r, t) = a(r, t) + b(r, f)/io. 7 (f) + c(r, t)Ap(t) + d(r, t)f w . 7 (t)Ap(t) (1) 

Inspite of the fact that the model is very simple - it is just an interpolation of arbitrary function 
(electron density) as a Taylor sequence using /10.7(f), Ap{t) as small parameters, it gives a good agreement 
with the experiment. But it does not depend on electron density creation mechanisms, and due to 
this it might have sometimes a weak prediction accuracy, inspite it could describe the data used for 
approximation pretty accurately. In some approaches to increase the model accuracy one could take into 
account a dependence on averaged values of the parameters /10.7(f), -Ap(f) too, for example on /10.7(f), 
averaged over 81 and 3 days. 

In the work I tried to suggest a way to build a regression local model with some physycal principles 
included. 

2 Modulation principle as a basis for the model. 

At first let us recall that electron density layers formation is mostly caused by solar radiation, and the 



effective radiation wavelength depend on the height |Hargreaves, 1992 . Also, these mechanisms depend 



on Earth orientation and cause a number of additional modulation effects. In the article we will analyze 
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electron density at a selected height, the other heights could be calculated independently using the same 
approach. 

At first let us analyze the modulation effects causing electron density variations at a selected height. 

The first effect is a modulation of solar radiation due to solar activity changes. These changes could 
be quantitatively characterized by solar sunspot number (W - Wolf numbers ) or by /10.7 index - solar 
radiation at f 0.7 cm wavelength. Inspite of ionizing radiation and /10.7 has different wavelengths (lnm 
and lOOnm for D-layer, l-10nm for E-layer, 10-100nm for F-layer), it is clear that /10.7 is a more correct 
parameter than W due to the fact that it could be defined more correct. 

The second modulation effect is electron density dependence on zenith angle of the sun as a function 
of daytime, day of year, longitude and latitude. This dependence controls the ionization processes and 
forms (together with some other mechanisms) the electron density dependence on height and time, also 
well-known as Chapmen layer [Chapmen , 1 93 1| . That is why the zenith angle is also well defined parameter 
that must be included into the model. In this work we do not analyze hourly variations of the electron 
density and due to this we use only position of the Earth at orbit as a basic parameter, which defines 
day-to-day variations of the zenith angle (at given latitude, longitude and height): 

Ded(t) = -0.40915 * cos{2%{DayNo{t) + 8) /365.25) (2) 

where DayNo(t) - day number within a year. 

We do not analyze electron density variations with periods less that 1 day so the usage of this 
parameter looks correct to us. For building the model that includes variations with periods less than a 
day this parameter should be changed or new parameter should be added to characterize hourly variations 
of electron density. 

The third parameter, traditionally included into models, is a magnetic activity index. Exact depen- 
dence of electron density on magnetic activity is not clear, but we suppose that it also modulates electron 
density. 

The forth modulation parameter includes all the hidden periodics that was not included into our 
model by previous 3 parameters. They might exist and be caused by different periodical changes in 
the ionization processes. There is a number of such potential processes, for example changes of neutral 
athmosphere, moon effects, etc. We do not know exact frequencies uJi and phases ipi of these periodics, 
so their number, phases and periodics should be found from data analysis. 

Finally the model of electron density could be written in a form: 

n 

Ne(t)=C (f w .7(t))*Ci(Ap(t))*C 2 (Dacl(t))* fl C^cos^t + (3) 

i = 1 

where C n (x) - some positive functions, and II - product sign. 

3 On choice of the basis functions. 

When positive function is a product of positive modulation functions (electron density is a positive 
function by definition) it is useful to analyze logarithm of this function as a sum of some functions, each 
of them defines its own modulation effect independently, i.e. these modulation functions work as a basis: 

n 

log(Ne)=D (f w . 7 (t))+D 1 (A P (t)) + D 2 (Decl(t))+ E D i+i (cos(wit + <&)) (4) 

i = 1 

where 

D i (x)=log(C i (x)) (5) 

One of well known applications that uses this approach is the so-called cepstral analysis |Gonorovskij,1986| 
- analysis of the logarithm of the function as a sum of periodical functions. Ideologically our technique 
is close to the cepstral analysis, but we use some non-peridical (but lineary independent) functions in 
addition to the periodical functions. 

Let us define a structure of the unknown functions Di from some basic physical principles. 

The structure of the D (x) is clear: due to the ionising component of the radiation wavelength differs 
from characterstic index /10.7 wavelength and most of the spectras in a wide region could be approximated 
by power low, the electron density could be in first approximation proportional to the power of the /10.7: 
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Ne(t) = Coi(ho.7(t)f 02 (6) 

where Coi,Co2 - some parameters. Inspite of solar spectrum shape could depend on solar activity 
level, at first approximation we could suppose that its shape remains constant and only its amplitude is 
changed with changing /10.7. This automatically means that in the absence of the solar radiation the 
electron density becomes zero, this suggestion does not contaminate with basic physical principles. 

So the Dq(x) becomes: 

A>(/io.r(«)) = Cox + C 02 log{f l0 . 7 {t)) (7) 

Lets also suppose that dependence on magnetic activity is also defined by power low (we do this 
analogous to the solar activity dependence and this could not be explained from physical point of view) : 

D^Ap{t)) = C n + C 12 log(Ap(t)) (8) 
Let us suppose that the dependence on daily variations has an exponential character: 

Ne(t) = C 21 exp{C 22 Decl{t)) (9) 

due to zenith angle exponentially defines electron density for Chapmen layer at a selected height 
|Chapmen,i93"T1 : 

Ne(t) = Aexp(-B — ] --) (10) 
cos(x) 

or, according to |Shimaza ki,1959| by extracting the dependence on zenith angle at local noon xo- 

Ne(t) = Aexp{-B ° 0s{ -} } ) (11) 
cos{x) 

where 

cos(x) = sin(LAT) ■ sin(Decl(t)) + cos(LAT) ■ cos(Decl(t)) ■ sin((XLT(t) - 6.) • 0.2617994) (12) 
XLT(t) - local solar time. 

So D 2 (x) could be written (in a first approximation, taking into account only linearized part of 
argument in ifTTj) ) as: 

D 2 {Decl{t)) = C 21 + C 22 Ded(t) (13) 
Other periodical functions are just linear ones: 

D 3+i (x) = C 3 ,ix (14) 
By grouping all these together we obtain the following model for electron density: 

Ne log (t) = log(Ne(t)) = C + C 02 log(f w . 7 (t)) + C 12 log(Ap(t)) + 



n 

-C 22 Decl(t) + E C 3+t ^i(cos(uj t t + tpi)) 
i = 1 



(15) 



where 

C = C01 + Cn + C 2 i (16) 

Actually, the ionosphere is pretty innertial system so its responce to the disturbancies of different 
periods could be different. Within the suggested approach we could suppose that different periods of 
disturbancies cause an independent modulation (we will call this multiranges modulation principle). Let 
us suppose that in the electron density time dependence we could find some well-defined response time: 
1 day as Earth rotation period, 3 days as average disturbance period and 30 days as approximate solar 
rotation/moon rotation period. The choice of the number and exact values of these periods is not principal 
for now and is used for illustration of the method. 

Lets define logarithm of electron density Nei og {t) as a sum of functions with given periods of changes: 

Ne log (t) = Ne logt<ld {t) + Ne log ^ [ld ^ d] (t) + Ne log , [3d ^ 0d] (t) + Nei og , >30d (t) (17) 
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The principle of making these functions Nei ogt< i d (t), Nei og> [x d<3d ], Nei ogt [ 3dj30d ](t), Nei og , >30d (t) from 
the source function Nei og (t) is not important. It is more important that the sum is exactly Nei og (t) and 
these functions have necessary time changes (jTTJ) . One of the most simple methods is using the time 
averaging or using niters with rectangular bands for almost all the functions (l-3days period, 3-30days 
period and more than 30days period). The last function (for example with 0-1 days periods) is denned 
to the condition lfl7| becomes true. 

Each of the components is approximated by the sum (fTo|) : 

Neiog, T (t) = C T + C 2,Tlog(fio. 7 (t)) + C 12 , T log(A P (t))+ 

+C 22 , T Decl(t) + £ C Wtl , T (cos(wit + <pi)) (18) 
i = 1 

where index T defines period ranges of corresponding variations: <ldays, [lday,3days], [3days,30days], 
>30days 

The resulting model for the electron density logarithm is the following: 



Ne log (t) = C+ £ C 2,Tlog T (fw.7{t)) + £ C 12 ,Tlog T {Ap{t))+ 
T T 

N W 
+C 2 2Decl(t) + £ C 3+i _i(cos(uJit + tpi)) 
i = 1 

where the sum is calculated over all the period ranges. The index T means applying the same nitration 
technique as used by us for selecting electron density periods to the argument functions: 

log(x) = log <ld (x) + log [ld - 3d] {x) + log [3d ^ 30d] (x) + log >30d (x) (20) 

Decl(x) = DecZ<i d (a;) + Decl[ ld . 3d ](x) + Decl[ 3d . 3Qd ](x) + Decl >3Qd (x) (21) 

So, according to this multiranges modulation principle we should choose important ionospheric re- 
sponse times and make the extended set of arguments by filtration of all the indexes with some filtration 
technique. 

In the paper we will analyze only variations with periods no less than a day. In this case the model 
l[T9|) becomes the following: 



Ne logi > ld (t) =C+ ZC 2,Tlog T (fw.7(t))+ £ C 12> Tlog T {Ap{t))+ 
T T 
N 

+C 2 2-Ded(t) + E C 3+ i,icos{u)it + ipi) 
i = 1 



(22) 



From right and left parts we have removed all the effects with periods less than a day. 
Shortly, let us describe the model ((22]): 

1) According to the filtration rule {T7|) we should average logarithm of electron density at given height 
for whole the day. This corresponds to the geometrical averaging of the electron density with following 
calculation of the logarithm. 

2) According to the filtration rule lfT7|) we should create extended set of indexes with period ranges 1 
day,l-3days,3-30days and more than 30days periods from basic set of paramters fio.7(t), Ap(t): 

^Ofl , ld(/l0.7(^))^05i[ ldi 3 d ](/io.7(i))^05r [ 3 tii 3 0d ](/io.7(0)^ 5 , >30d(/l0.7(0) I 

log ld (Ap(t))Jog [ld . 3d] {Ap(t))Jog [3dt30d] (Ap(t))Jog >30d (Ap(t)), (23) 
Decl T (t) 

3) Next step is the calculation of the coefficients C, C02.T, Ci2,t, C22from l(22|) and minimizing the 
functional: 
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n({C})= E (Ne log ,> u {t)-C + E C 02 , T Zo ffT (/ 10 . 7 (i)) + E C 12 , T % T (^(i))+ 
t T T 

N 

+C 2 2DedT(t) + E C 3+i ,icos(uJit + i^)) 2 
i = 1 



(24) 



At first step we suppose N = 0. 
4) According to the technique jKuklin et al,2000| we will increase N by 1 until some stop condition 



is reached, and repeat actions from step 3). We determine parameters Cs+n^i, ujn, ip^ for one periodic 
at each step based on functional l[24"|) minimum, using all the previous periodics coefficients Cz+kn,i , 
u>i<N, ^t<wand parameters C, Co2,t, Ci2,T, C22 determined at previous steps. 

The only problem for using this approach is to choose correct stop condition (we could increase N up 
to infinity). Lets analyze the problem of stop condition more detailed. 



4 Determination of periodical components - two approaches to 
the modelling. 

There are at least two close but not similar approaches for solving this task. Of couse we are ordering 
parameters by their effectivity (energy of corresponding model component) . 

Approach 1, or description aproach: to obtain a maximal accuracy for description of the data. 

It is used most frequently. It is clear, that when using large enough set of periodical functions, we 
could approximate almost any function as accurate as required. The most effective way to demonstrate 
it is Fourier transform, that completes this task exactly. That is why in this approach the number of 
periodics could not be limited by the stated condition - we always could add next periodic and obtain 
higher accuracy of the model. So usually one could limit number of periodics by hands (for example 
- calculate only 3, 5 or 10 periodics) or stop when amplitude of periodic becomes lower than somehow 
defined noise of the data. 

Approach 2, or prediction approach: to obtain a maximal accuracy for prediction of the data. 

In opposite to the description approach, for prediction approach we should divide dataset into two non- 
intersecting intervals - analysis interval and prediction interval. All the model coefficients are defined from 
the analysis interval, but stop condition is defined from prediction interval, to the accuracy at prediction 
interval would be as effective as possible. This approach automatically limits number of periodics by not 
taking noise components into consideration. It is clear, that due to data noise, non-stationarity of the 
data and effects not included into the model an error will start to increase after the number of periodics 
exceeds some value. While the number of periodics less than this border number, the model remains 
adequate. When it exceeds this border number, the model becomes inadequate. There are at least two 
ways to calculate this stop condition within prediction approach. The first way is to stop when first 
local minimum of the error is reached at prediction set (absolute stop condition). The second way is 
to stop when a local minimum of relation between error on prediction set and error on analysis set is 
reached (relative stop condition) . We will use the second variant for analysis of Irkutsk data in prediction 
approach. 



5 Local electron density models based on Irkutsk digisonde data. 

For making the model the data from Irkutsk digisonde DPS-4 |Reinisc h,1997| has been analyzed. The 
data covers more than 4 years, from 2003 till 2007. From the data analysis there was excluded the most 
magneticaly disturbed periods with Ap > 100. The experimental data for electron density logarithm, 
averaged by days, are shown at fig.l. 
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Fig.l Logarithm of electron density, averaged over 24 hours for heights 120-240 km. Irkutsk digisonde data. 



Even from the shown data it is clear that the modulation model for electron density is a very good 
approximation: as one can see, the variations with different periods look independent and have amplitudes 
independent on time (in average). 

For testing the technique we have divided the whole set fo the data into two subsets: analyze set - 
2003-2006 years and prediction set - 2007 year. 

At fig. 2 the comparison between experiment and description model is shown. The model shown at 
fig. 2 has been built without any additional periodics (N = 0). At table 1 the accuracy of this model over 
the two different sets - analyze and prediction ones (columns 2-3) is shown. As one can see, the accuracy 
for prediction set is a little bit higher than for analyze set. It looks strange, but could be explained: 
absolute values of the electron density decreases with time, as can be seen from the fig. 2 (2007 year is 
close to solar activity minimum). The order of density changes is about 30% and this corresponds to the 
observed difference between prediction and analyze sets accuracy. 
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Fig 2. Comparison between the experiment and description model for 200 km height (top) and 120km height 
(bottom). N=0 

At fig. 3 a comparison of the experimental data with another description model (with big number of 
additional periodics , N = 50) is shown. At the table 1 the accuracy for this model over the two different 
sets - analyze and prediction ones (columns 4-5) is shown . 
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Fig 3. Comparison between the experiment and description model for 200 km height (top) and 120km height 
(bottom). N=50 



At fig. 4 a comparison of the experimental data with prediction model is shown. At table 1 the accuracy 
for this model in two different sets - analyze and prediction ones (columns 6-7) is shown. A number of 
additional periodics is shown at column 8. 
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Fig 4. Comparison between the experiment and prediction model for 200 km height (top) and 120km height 
(bottom). 



Table L Model accuracy depending on model, data set and height. 



Height 


analyze (0) 


preidction(O) 


analyze(50) 


prediction(50) 


analyze(opt) 


prediction(opt) 


N opt 


120 


9.6% 


11.2% 


5.5% 


11% 


9.1% 


8.8% 


4 


140 


10% 


12.1% 


7% 


17% 


10.9% 


12.3% 


2 


160 


14.4% 


9.3% 


7.6% 


19% 


12.1% 


12.3% 


4 


180 


19.8% 


10.8% 


8.5% 


24% 


15.8% 


15.8% 


5 


200 


25.4% 


20% 


10.7% 


32.5% 


22.8% 


23.2% 


3 


220 


24% 


14.8% 


10.2% 


26.6% 


19.2% 


21.3% 


6 


240 


22.1% 


14.5% 


9.2% 


23.8% 


17% 


19.4% 


7 



Column 1. - Height, km; Column 2. - descirption model, error over analyze set(2003-2006 years), N=0; Column 
3. - descirption model, error over prediction set(2007 year),N=0; Column 4. - descirption model, error over analyze 
set(2003-2006 years), N=50; Column 5. - descirption model, error over prediction set(2007 year),N=50; Column 
6. - prediction model, error over analyze set(2003-2006 years), N from stop condition; Column 7. - prediction 
model, error over prediction set(2007 year),N from stop condition; Column 8. Number of additional periodics for 
prediction model. 



From the table 1 one can see that description model provides a better accuracy over analyze set than 
prediction model over analyze set (columns 4,6). 

From the data also one can see that accuracy of description model is increased with increasing number 
of parameters (columns 2,4). But, over prediction set of the data (not included into analysis) the predic- 
tion model could make better accuracy than description model with big number of parameters (columns 
5,7). So when making a model for good description of the ionospheric electron density variations we 
should take an exact set of parameters. Both small number of parameters and big number of parameters 
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could increase the error when using it over prediction set of the data, i.e. when using the model for 
prediction purposes. 

From the analysis one could also see that with increasing of the height the number of additional 
periodical variations that should be taken into account (column 8) increases. 



6 Conclusion 

In the paper the step-by-step principles for making local model of electron density f22|) are described. 
They are based on modulation principle - electron density dependence on time is a product of temporal 
variations caused by different sources: solar radiation, magnetic activity, Earth orientation and unknown 
additional periodical processes (but not a sum, as sometimes suppose when making local models). A 
multiranges modulation principle is also described, that allows automatically extend set of parameters 
by using new indexes obtained by filtration (or averaging) basic set of parameters over the time. In the 
paper we have described two approaches to the model creation - descriptional and predictional ones. 

To test the approach a three different models were created for daily electron density logarithm using 
the stated principles. We have used the data of Irkutsk digisonde over the period 2003-2007 years for 
testing. It becomes clear that non-optimal choice of the number of model parameters could increase 
prediction error, inspite the error over the set, used for analysis, will decrease. It is shown that one year 
prediction has accuracy about 9-23% depending on the height, and the highest error corresponds to the 
height about 200km. From the modelling we can also see that with increasing of the height the number 
of parameters increases, and this could be caused by inaccuracy of the model or by not including an 
additional physical mechanisms into consideration. 

Author thanks to L.A.Schepkin, N.V.Ilyin and K.G.Ratovsky for useful discussions, and to A.A.Berngardt 
for help in preparing this publication. 

The work was done under support of RFBR grant #05-07-90212-v 
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